Intonational phrases for speech summarization
نویسندگان
چکیده
Extractive speech summarization approaches select relevant segments of spoken documents and concatenate them to generate a summary. The extraction unit chosen, whether a sentence, syntactic constituent, or other segment, has a significant impact on the overall quality and fluency of the summary. Even though sentences tend to be the choice of most the extractive speech summarizers, in this paper, we present the results of an empirical study indicating that intonational phrases are better units of extraction for summarization. Our study compared four types of input segmentation: sentences, two pause-based segmentation, and intonational phrases (IP). We found that IPs are the best candidates for extractive summarization, improving over the second highest-performing approach, sentence-based summarization, by 8.2% F-measure.
منابع مشابه
Pitch patterns of intonational phrases and intonational phrase groups in native and non-native speech
We examined pitch patterns within and across intonational phrases of Japanese read aloud by native and non-native (Mandarin Chinese) speakers. Japanese speakers change pitch ranges for each intonational phrase. The relative pitch ranges of neighboring intonational phrases indicate which intonational phrase belongs to which intonational phrase group. Chinese speakers are unable to acoustically c...
متن کاملSyntactic and prosodic parenthesis
This paper examines the view that parentheticals obligatorily form an intonational phrase and break up the intonational phrase of the matrix sentence into two intonational phrases. The analysis of spontaneous speech data of Hamburg German shows that neither do all parentheticals form a distinct intonational phrase nor do all parentheticals break up the intonational phrase of the matrix sentence...
متن کاملModeling spontaneous speech events during recognition
In spontaneous speech, speakers segment their speech into intonational phrases, and make repairs to what they are saying. However, techniques for understanding spontaneous speech tend to treat these events as noise, in the same manner as they handle out-of-grammar constructions and misrecognitions. In our approach, we advocate that these events should be explicitly modeled. We modify the speech...
متن کاملProsody in a corpus of French spontaneous speech: perception, annotation and prosody ~ syntax interaction
Our study focuses on the issue of prosodic annotation and of the prosody ~ syntax interface in conversation and is based on a large corpus of conversational speech in French. The results of inter-transcriber agreement tests show that two expert transcribers are consistent in their labeling of prosodic phrasing and the consistency is well above the chance. A qualitative analysis reveals transcri...
متن کاملLength, ordering preference and intonational phrasing: evidence from pauses
This paper reports a speech production experiment in which the effects of surrounding phrase lengths and head-argument distance on intra-sentential pause duration were tested. While the results confirm an effect of phrase length on pausing, this effect is found to be distinctly stronger for long phrases preceding the pause than for long upcoming phrases. The results are discussed with respect t...
متن کامل